Planning Under Uncertainty Using Reduced Models: Revisiting Determinization

نویسندگان

  • Luis Enrique Pineda
  • Shlomo Zilberstein
چکیده

We introduce a family of MDP reduced models characterized by two parameters: the maximum number of primary outcomes per action that are fully accounted for and the maximum number of occurrences of the remaining exceptional outcomes that are planned for in advance. Reduced models can be solved much faster using heuristic search algorithms such as LAO*, benefiting from the dramatic reduction in the number of reachable states. A commonly used determinization approach is a special case of this family of reductions, with one primary outcome per action and zero exceptional outcomes per plan. We present a framework to compute the benefits of planning with reduced models, relying on online planning when the number of exceptional outcomes exceeds the bound. Using this framework, we compare the performance of various reduced models and consider the challenge of generating good ones automatically. We show that each one of the dimensions—allowing more than one primary outcome or planning for some limited number of exceptions— could improve performance relative to standard determinization. The results place recent work on determinization in a broader context and lay the foundation for efficient and systematic exploration of the space of MDP model reductions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revisiting the Effects of Growth Uncertainty on Inflation in Iran:An Application of GARCH-in-Mean Models

This paper investigates the relationship between inflation and growth uncertainty in Iran for the period of 1988-2008 by using quarterly data. We employ Generalized Autoregressive Conditional Heteroscedasticity in Mean (GARCH-M) model to estimate time-varying conditional residual variance of growth, as a standard measures of growth uncertainty. The empirical evidence shows that growth uncertain...

متن کامل

Probabilistic Planning with Reduced Models

Markov decision processes (MDP) (Puterman 1994) offer a rich model that has been extensively used by the AI community for planning and learning under uncertainty. Some applications include planning for mobile robots, network management, optimizing software on mobile phones, and managing water levels of river reservoirs. MDPs have polynomial complexity in the size of the state space, but the sta...

متن کامل

Optimizing Electric Vehicle Charging Through Determinization

We propose a determinization based approach to optimize the charging policies of an electric vehicle (EV) operating in a vehicle-to-grid (V2G) setting. By planning when to charge or discharge electricity from the vehicle, the long-term cost of operating the EV can be minimized, while being consistent with the owner’s preferences. For an EV operating under price uncertainty caused by the dynamic...

متن کامل

Trajectory-Based Short-Sighted Probabilistic Planning

Probabilistic planning captures the uncertainty of plan execution by probabilistically modeling the effects of actions in the environment, and therefore the probability of reaching different states from a given state and action. In order to compute a solution for a probabilistic planning problem, planners need to manage the uncertainty associated with the different paths from the initial state ...

متن کامل

A Single Machine Capacitated Production Planning Problem Under Uncertainty: A Grey Linear Programming Approach

The production planning is an important problem in most of manufacturing systems in practice. Unlike many researches existing in literature, this problem encounters with great uncertainties in parameters and input data. In this paper, a single machine capacitated production planning problem is considered and a linear programming formulation is presented. The production costs are assumed to be u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014